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Abstract 


The continuous mobilization of vehicles has led to a surge in the number of 
road accidents across the world. To get better at this, the government is trying 
to focus on the safest and most preventive measures in traffic. The practice of 
direct observation is found to be time taking and a lot of human effort is needed. 
So, our main idea is to introduce a helmet and license plate detection mech- 
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gorithms; anism. This project attempts to implement a detection process through a few 
OCR; machine-learning algorithms by using predefined libraries. This system notices 
R-CNN; a person with/without a helmet thereby imposing fines on the detected candi- 
RPN; date’s license plate. Further, this research work concludes that the automatic 
— identification of helmets can overcome the challenges faced by the manual data 
KNN classifier collection process. Moreover, this project work assumed that, through data col- 


lection, the algorithm can help to track helmet use and promote its active use 
by people to ensure road safety. 


with its own mass brain continues to be in motion 
until the object hits inner part of the skull. Some- 
times this type of head injury may be fatal in nature. 
In such times helmet acts as life savior (Li et al.) . 
Helmet reduces the chances of skull getting decel- 
erated, hence sets the motion of the head to almost 
zero. Cushion inside the helmet absorbs the impact 
of collision and as time passes head comes to a 
halt. It also spreads the impact to a larger area, thus 
safeguarding the head from severe injuries. More 
importantly it acts as a mechanical barrier between 
head and object to which the rider came into con- 
tact (Vishnu et al.) . Injuries can be minimized if 
a good quality full helmet is used. Traffic rules 
are there to bring a sense of discipline, so that the 
risk of deaths and injuries can be minimized signif- 
icantly. However strict adherence to these laws is 
absent. Hence efficient and feasible techniques must 


1. Introduction 


All over the world around 1.35 million lives are 
lost each year, 50 million people are getting injured 
due to road accidents, according to a report titled 
“ The Global status Revised Manuscript Received 
on December 05, 2019 report on road safety 2018” 
released by world health organization. It is very hard 
to imagine that this burden is unevenly borne by 
motorcyclists, cyclists, and pedestrians. This report 
noted that a comprehensive action plan must be set 
up in order to save lives (Silva and Romuere) . Two- 
wheeler is a very popular mode of transportation in 
almost every country. However, there is a high risk 
involved because of less protection. When a two- 
wheeler meets with an accident, (Doungmala and 
Klubsuwan) due of sudden deceleration, the rider 
is thrown away from the vehicle. If head strikes 
any object, motion of the head becomes zero, but 


OPEN ACCESS 242 


Helmet Detection and Number Plate Recognition Using Machine Learning 


be created to overcome these problems. To reduce 
the involved risk, it is highly desirable for bike- 
riders to use helmet (Yadav and Singh) . Worrying 
fact is that India ranks in top as far as road crash 
deaths are considered. (Mahesh and Baskar) Rapid 
urbanization, avoiding helmets, seat belts and other 
safety measures while driving are some of the rea- 
sons behind this trend according to analysis done by 
experts. In 2015 India signed Brasilia Declaration 
on Road Safety, where India committed to reduce 
road crash deaths to 50 percent by 2020. (Suresh 
et al.) Observing the usefulness of helmet, Govern- 
ments have made it a punishable offense to ride a 
bike without helmet and have adopted manual strate- 
gies to catch the violators (Priya et al.) .However, 
the existing video surveillance-based methods are 
passive and require significant human assistance. In 
general, such systems are infeasible due to involve- 
ment of humans, whose efficiency decreases over 
long duration. (Chirag et al.) Automation of this 
process is highly desirable for reliable and robust 
monitoring of these violations as well as it also sig- 
nificantly reduces amount research has successfully 
done this work based on CNN,R-CNN,HoG,HaaR 
features,etc. (Vishnupriya et al.) But these works are 
limited with respect to efficiency, accuracy or the 
speed with which object detection and classification 
is done. 


2. Literature Survey 


Literature survey is mentioned in Table 1. 


Extract characters 
from number plate 


FIGURE 1. Phases of Development 


2023, Vol. 05, Issue 05S May 


3. Methodology 
3.1. Phases of methodology 


1. Designing a module for functions to detect the 
helmet in the frame. 

2. Designing a module to detect the number plate 
and extract the vehicle number from frame. 

3. Connecting all the modules together and test- 
ing the integrity and accuracy of the system. 


3.2. Phases of Development 


1. Taking video or camera as input. 

2. Taking single frame from that input. 

3. Checking if that frame contains a helmet. 

4. If the helmet is present then going back to 2nd 
stage 

5. If helmet is not present then giving this frame 
to the function which detects number plate and 
extracts characters from it. 

6. Repeating this procedure till the input is not 
empty/null. 


3.3. Implementation 


Using YOLOv3 The YOLOv3 algorithm first sepa- 
rates a fame into a grid. Each grid cell predicts some 
number of boundary boxes (sometimes referred to 
as anchor boxes) around objects that score highly 
with the aforementioned predefined classes. Each 
boundary box has a respective confidence score 
of how accurate it assumes that prediction should 
be and detects only one object per bounding box. 
The boundary boxes are generated by clustering the 
dimensions of the ground truth boxes from the orig- 
inal dataset to find the most common shapes and 
sizes. The object detection problem is treated as a 
regression problem in the YOLO algorithm and the 
image is divided into an S x S grid. If the cen- 
tre of a target falls into a grid, the grid is respon- 
sible for detecting the target. Each grid will out- 
put a bounding box, confidence, and class probabil- 
ity map. Among them. The bounding box contains 
four values: x, y, w, h, (x, y) represents the cen- 
tre of the box. (W, h) defines the width and height 
of the box. Confidence indicates the probability of 
containing objects in this prediction box, which is 
the IoU value between the prediction box and the 
actual box. The class probability indicates the class 
probability of the object, and the YOLOv3 uses a 
two-class method. 
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TABLE 1. Literature Survey 


YEAR TITLE NAME AUTHOR SURVEY 
NAME 
2021 Real-time Helmet Detection and Y. Zhang and Y.Du Faster R- CNN object 
Tracking System for Motorcycle detection algorithm , a 
Riders multi- object tracking 
algorithm. 
2021 A Survey on Helmet Detection N.G. Cuong, Deep learning. 
Methods in Motorcycle Riders N. K. Quoc, and 
for Intelligent Transportati on L. K. Son 
System 
2020 Helmet Detection and Violation N. H. Tran, YOLOv3 object detec- 
Analysis in Motorcycle Riders T. Q. Nguyen, and K. T. tion algorithm, a classi- 
Using Deep Learning Techniques Nguyen ficatio n model 
2020 Helmet Net: Efficient Helmet V.R. Suwardi, CNN model, a classifi- 
Detection for Riders Using Con- A.D. Wibowo, and E. F. Adi catio n model 
volution Neural Networks 
2022 Helmet detection using deep K.Kumar et al CNNs and the YOLOv4 
learning for motorcycle riders object detection algo- 
rithm 
2022 Helmet Detection and Track- Zhang etal. a helmet detection and 
ing for Motorcycle Riders using tracking system for 
Machine Learning motorcycle riders using 
machine learning 
3.4. OCR a scanned document, a photograph of a document, a 
scene photograph, or subtitle text superimposed on 
a picture. 
PREPROCESSING 
ros i Ty 3.4.1. Acquisition 
seaman = al pera is Obtaining non-editable text content from scanned 
documents of all types, from flatbed scans of cor- 
eT | porate archival material through to live surveillance 
.7 footage and mobile imaging data. 
3.4.2. Pre-processing 
post FEATURE 
PROCESSING EXTRACTION 


LP 


FIGURE 2. Optical Character Recognition 


CLASSIFICATION 


Optical character recognition or optical character 
reader is that the electronic or mechanical conver- 
sion of pictures of written, handwritten, or printed 
text into machine-encoded text, whether or not from 


Cleaning up the source imagery at an aggregate level 
so that the text is easier to discern, and noise is 
reduced or eliminated. OCR software often “prepro- 
cess” images to boost the chances of recognition. 


3.4.3. Segmentation and feature extraction 


Scanning of the image content for groups of pix- 
els that are likely to constitute single characters, and 
assignment of each of them to their own class. The 
machine learning framework will then attempt to 
derive features for the recurring pixel groups that 
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it finds, based on generalized OCR templates or 
prior models. However, human verification will be 
needed later. 

There are two main methods for extracting fea- 
tures in OCR: 

1. In the first method, the algorithm for feature 
detection defines a character by evaluating its lines 
and strokes. 

2. In the second method, pattern recognition 
works by identifying the entire character. We can 
recognize a line of text by searching for white pixel 
rows that have black pixels in between. Similarly, 
we can recognize where a character starts and fin- 
ishes. 


3.4.4, Training 


Once all features are defined, the data can be pro- 
cessed in a neural network training session, where 
a model will attempt to develop a generalized 
image>text mapping for the data. 


3.4.5. Verification and re-training 


After processing, humans evaluate the results, with 
corrections fed back into subsequent training ses- 
sions. At this point, data quality may need to 
be reviewed. Data cleaning is time-consuming 
and expensive, and while initial training runs will 
perform de-skewing, high contrast processing, and 
other helpful methods to obtain a good algorithm 
with minimal pre-processing, further arduous refine- 
ment of the data may be necessary. OCR accuracy 
can be improved if the output is limited by a lexi- 
con (a list of words permitted in a document). For 
instance, this could be all the words in English, or 
a more technical lexicon for a particular field. This 
method can be less efficient if the document con- 
tains words that are not in the lexicon, like proper 
nouns. Fortunately, to improve accuracy, there are 
OCR libraries available online for free. The Tesser- 
act library is using its dictionary to control the seg- 
mentation of characters 


4. Algorithms : 
4.1. R-CNN: 


Regions with CNN features (R-CNN) and its 
upgraded versions Faster R-CNN and Fast R-CNN 
are examples of algorithms. R-CNN uses a DL 
model to remove image attributes and create region 
proposals with a sliding window at start, but it has a 
lot of recurring calculations. Fast R-CNN uses the 
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SPP module to provide fixed-size output while inte- 
grating the regression and classification of bounding 
boxes into a network to diminish frequent calcula- 
tions. R-CNN inputs picture features into Region 
Proposal Networks (RPN) at a faster rate. The 
RPN be able to accept feature maps of several size, 
object candidate box confidence and output coordi- 
nate information, and then categorize the object can- 
didate boxes. In two-stage object detection model 
like Faster R-CNN, area classification and selection 
should be done step by step. With the advancement 
of deep learning, factors such as the number of can- 
didate boxes, the difficulty of basic network, the dif- 
ficulty of classification, and regression sub-networks 
affect two-stage detection algorithms and the quan- 
tity of calculation continues to rise. 


4.2. YOLO: 


In the algorithm, YOLO skips the candidate box 
extraction stage and instead uses an end-to- end deep 
convolution network to perform candidate box clas- 
sification, feature extraction, and regression. Safety 
helmet wearing detection is an important practi- 
cal application in object identification that is inti- 
mately tied to our production and daily lives. It 
has been the subject of extensive research by a 
number of academics. The authors in retrieved 
the features of helmet and worker from the image 
and used a cascade to determine if the worker was 
wearing a helmet. Based on pedestrian detection 
results. Used head positioning, color space trans- 
formation, and color feature recognition to recog- 
nize people wearing helmets. To improve feature 
resolution in helmet detection applied the upgraded 
YOLO-Dense backbone. To detect wearing helmets, 
authors in employed Single Shot multi-box Detector 
(SSD). However, current helmet detection systems 
have drawbacks, like limited generalization capac- 
ity for multi-scene detection and low detection accu- 
racy for small items. The author in employed the 
enhanced Faster-RCNN technique to detect helmet 
wearing after using the K- means++ clustering algo- 
rithm to cluster the helmet’s size in the image. How- 
ever, the one- stage detector has a faster detection 
speed among the present helmet-wearing detection 
algorithms, but its detection accuracy for dense and 
small targets is low, and its generalization capacity 
of diverse scenes is pathetic. Due to the enormous 
computation amount and sluggish detection speed of 
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FIGURE 3. Architecture diagram 


the two-stage detector, it is impossible to congregate 
the real time necessities of helmet detection. 


5. Proposed System : 


e Such as a group of the dataset, moving 
object detection, background subtraction, object 
classification using neural networks and extraction 
of number plate number if the rider is not wearing a 
helmet. 

e Used a KNN classifier for moving object extrac- 
tion and classification. Here the head is classified as 
wearing a helmet or not based on various features 
obtained from the segmented head region moving 
objects can be detected using adaptive background 
subtraction. 

e ViBe background modeling algorithm can also 
be applied to detect moving objects. The Canny 
edge detection algorithm is used to get segmented 
moving objects. 


6. System Requirements: 
6.1. Hardware Requirements: 
Hard disk : 80GB 

RAM : 8 GB (min), 16 GB or higher is recom- 
mended. 

Processor : 64 bit, four core ,2.5GHz minimum 
per core 

Monitor :15’Color Monitor 


6.2. Software Requirements: 


Operating System : Windows 10 and 11 (Intel/AMD 
64 bit) , 
Linux (Intel/AMD 64 bit kernel 


7. Architecture Diagram: 
Architecture diagram is shown in Figure 3. 


8. Result: 


When we give the input video wearing helmet, it 
successfully detects the helmet and shows the con- 
fidence score and also it prints “Helmet Detected!” 
on the console When the person is not wearing hel- 
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FIGURE 4. Output 


met the system searches for the number plate in 
the frame. Once detected it extracts characters and 
prints on the console. The result image is shown in 
Figure 4. 
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